Higher order image pyramids: an early visual representation
نویسنده
چکیده
The scale invariant property of an ensemble of natural images is examined, which motivates a new early visual representation termed the higher order pyramid. The representation is a non-linear generalization of the Laplacian pyramid, tuned to the type of scale invariance exhibited by natural imagery as opposed to other scale invariant images such as 1/f correlated noise and the step edge. The transformation of an image to a higher order pyramid is simple to compute and straightforward to invert. Because the representation is invertible it is shown that the higher order pyramid can be truncated and quantized with little loss of visual quality. Images coded in this representation have much less redundancy than the raw image pixels and decorrelating transformations such as the Laplacian pyramid. This is demonstrated by showing statistical independence between pairs of coefficients. The representation is tuned to the ensemble redundancies, and hence the coefficients of the higher order pyramid are more efficient at capturing the variation within the ensemble which leads to improved matching results. This is demonstrated on several recognition tasks: object recognition with viewpoint changes, object recognition with scale changes, and face recognition with illumination changes.
منابع مشابه
Higher Order Image Pyramids
The scale invariant property of an ensemble of natural images is examined which motivates a new early visual representation termed the higher order pyramid. The representation is a non-linear generalization of the Laplacian pyramid and is tuned to the type of scale invariance exhibited by natural imagery as opposed to other scale invariant images such as 1/f correlated noise and the step edge. ...
متن کاملDBRIS at ImageCLEF 2012 Photo Annotation Task
For our participation in the ImageCLEF 2012 Photo Annotation Task we develope an image annotation system and test several combinations of SIFT-based descriptors with bow-based image representations. Our focus is on the comparison of two image representation types which include spatial layout: the spatial pyramids and the visual phrases. The experiments on the training and test set show that ima...
متن کاملExplorative Analysis of Graph Pyramids Using Interactive Visualization Techniques
Hierarchies of plane graphs, called graph pyramids, can be used for collecting, storing, and analyzing geographical information based on images or other input data. The visualization of graph pyramids facilitates studies about their structure, such as their vertex distribution or height in relation of a specific input image. Thus, a researcher can debug contraction processes and ask for statist...
متن کاملOpen Issues and Chances for Topological Pyramids * Pattern Recognition and Image Processing Group
High resolution image data require a huge amount of computational resources. Image pyramids have shown high performance and flexibility to reduce the amount of data while preserving the most relevant pieces of information, and still allowing fast access to those data that have been considered less important before. They are able to preserve an existing topological structure (Euler number, homol...
متن کاملA Novel Concept for Smart Camera Image Stitching
As panoramic images are widely used in many applications, efficient image stitching methods that provide visually pleasant image mosaics are needed. In this paper we discuss a novel concept for smart camera image stitching based on graph pyramids. For a multi-camera system, the images have to be aligned accordingly to create an image mosaic. Instead of calculating the corresponding transformati...
متن کامل